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1. A Brief History of SARS 

As outlined in Table 26.1 , the first reported case of “atypical pneumonia,’’ now known 
as Severe Acute Respiratory Syndrome or SARS, occurred in Guangzhou, Guangdong 
province, China, on November 16, 2002. Before the end of February 2003, a total of 11 
index cases occurred independently in nine cities of Guangdong Province, which 
forms the early phase of the SARS epidemic. These index cases spread the virus to 
their close relatives and hospital staffs and provided the early demonstration of the res¬ 
piratory transmission mode of the disease. The clinical symptoms of SARS are 
nonspecific. The index cases all began to have fever higher than 38°C and displayed 
common respiratory symptoms, such as cough, headache, and shortness of breath. 

The dynamics of the outbreak was largely shaped by the presence of the so-called 
super spread event (SSE), in which a single patient was shown to spread the virus to a 
large number of contacts. It is the SSEs that triggered the large scale of SARS 
pandemic in China. The first SSE patient is a businessman specialized in fishery 
wholesale. He was treated in three hospitals from January 30, 2003 to February 10, 
2003 and along the way infected at least 78 other individuals including hospital staffs, 
patients, and close relatives and friends. The second SSE individual, who caused the 
major spread of the disease out of Guangdong, was a business lady, native of Shanxi 
province. She went to Guangdong for business in late February and become sick while 
traveling. She went back to her home province and infected eight family members as 
well as five hospital staffs. The spread continued to Beijing when she decided to seek 
better treatment in Beijing. 

The beginning of the global transmission occurred in Metropole Hotel of Hong 
Kong where a professor of nephrology from a Guangdong hospital stayed during a pri¬ 
vate visit. Without knowing, the urologist was infected with SARS-CoV a few days 
before he traveled to Hong Kong. It is later found that he spread the virus to at least 
15 other persons in the hotel and in the hospital where he was treated. Among 
them, five of the hotel contacts continued their international journeys and further trans¬ 
mitted the disease to Vietnam, Singapore, Canada, and other countries. This marks the 
true beginning of a disastrous worldwide pandemic (http://www.who.int/csr/sars/en/). 
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Table 26.1 Chronological Events of the SARS Outbreaks 


Date 

Event 

November 16, 2002 

The first recognized SARS patient, in Foshan, Guangdong 
province, China 

November 16, 2002 
to March 10, 2003 

11 independent index cases in Foshan, Heyuan, Jiangmen, 
Zhongshan, Shunde, Guanzhou, Zhaoqing, Shenzhen, 

Dongguan, China, resulting in more than 50 secondaiy infections 

January 22, 2003 

SARS spreading in Guangdong province 

March 22, 2003 

SARS spreading to Shanxi and Beijing 

February 21, 2003 

SARS spreading to Hong Kong, marking the beginning of the 
global pandemic 

February 28, 2003 

SARS spreading to Vietnam 

March 12, 2003 

WHO global travel alert of the SARS pandemic 

March 14, 2003 

SARS spreading to Canada 

March 6, 2003 

SARS spreading to Singapore 

March 17, 2003 

WHO established a 9-nation/l 1-institute international laboratory 
network 

March 24, 2003 

Coronavims was isolated from SARS patient 

April 4, 2003 

SARS spreading to Philippines 

April 12, 2003 

Full-length genome of SARS-CoV determined 

April 17, 2003 

The international laboratory network announced conclusive 
identification of SARS-CoV as the causative agent 

May 23, 2003 

Detected SARS coronavirus in market animals 

July 5, 2003 

WHO removed the last region from the affected list, effectively 
marking the end of the outbreak 

August 7, 2003 

WHO reported a total of 8096 cases and 774 death covering the 
major 2002—2003 outbreaks 

September 2003 to 
April 2004 

Outbreaks caused by laboratory incidents in Singapore, Taiwan, 
and Beijing 

December 16, 2003 
to January 8, 2004 

Four independent SARS cases in Guangdong, causing mild disease 
with no death 


WHO played a key role in the investigation and control of the SARS outbreak from 
the very beginning. For the first time in history, WHO issued a global travel alert on 
March 12,2003, which greatly reduced the rate of long-distance transmission of the dis¬ 
ease. On March 17, 2003 WHO established a 9-nation/l 1-institute SARS network that 
played a major role in the rapid identification of the causative agent and development of 
diagnostic tests. Thanks to the international effort co-coordinated by WHO, the SARS 
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outbreaks were effectively under control by July 5, 2003. This was the first powerful 
demonstration of the kind of devastation a new infectious disease can cause worldwide 
and the effectiveness of an international organization when it is running at its peak. 

Following the major SARS outbreaks of 2003—2004, there were several minor out¬ 
breaks with much smaller impacts. Between December 2003 and January 2004, four in¬ 
dependent SARS cases were reported in Guangdong, and none of them led to fetal 
infection or widespread transmission. Subsequent epidemiological tracing revealed 
that all cases could be linked to civet trading activities. In addition, there were three lab¬ 
oratory outbreaks in September 2003, December 2003, and April 2004 in Singapore, 
Taiwan, and Beijing, respectively. The most severe outbreak was associated with the 
incident in Beijing that resulted in a total of nine infection cases with one death. 
None of the other two laboratory infections resulted in further spread of the virus. 


2. SARS Coronavirus 

Rapid identification of causative agent in an outbreak caused by unknown pathogen is 
the key for an effective response. However, in the case of SARS outbreak, this was not 
the case. Due to the association of nonspecific clinical symptoms associated with SARS 
patients, several pathogens were initially “identified” as the potential causes of SARS, 
which included Chlamydia , influenza virus, and paramyxovirus. The confusion 
continued until March, 2003 when three laboratories independently confirmed that a 
previously unknown coronavims was the most likely etiological agent of SARS. 

Coronaviruses are enveloped viruses with the largest single-stranded, positive- 
sense RNA genomes currently known, ranging in size from 27 to nearly 32 kb in 
length. Coronaviruses can infect and cause disease in a broad array of avian and 
mammal species, including humans. The name “coronavirus” is derived from the 
Greek word, meaning crown, as the virus envelope appears under electron microscopy 
to be crowned by a characteristic ring of small bulbous structures. Within the virion, 
the ssRNA genome is encased in a helical nucleocapsid composed of many copies of 
the nucleocapsid (N) protein. The lipid bilayer envelope contains three proteins: enve¬ 
lope (E) and membrane (M) protein, which coordinate virion assembly and release, 
and the large spike (S) protein, which confers the virus’s characteristic corona shape 
as well as serves as the principal mediator of host cell attachment and entry via virus- 
and host-specific cell receptors. The size of the SARS-CoV viral particle is approxi¬ 
mately 80—90 nm and its genomic size is around 29.7 kb. The SARS-CoV genome 
contains 14 open reading frames (ORFs) flanked by 5'- and 3'-untranslated regions of 
265 and 342 nucleotides, respectively. While all CoVs carry strain-specific accessory 
proteins encoded by their downstream ORFs, the order of essential genes—the repli- 
case/transcriptase gene, S gene, E gene, M gene, and N gene—is highly conserved. 
Similar to other known coronaviruses, the SARS-CoV genome expression starts with 
two long open reading frames (ORFs), ORFla and ORFlb, which account for two- 
thirds of the genomic capacity, followed by ORFs encoding S, E, M, and N proteins 
(Fig. 26.1). In addition to these conserved core genes in coronaviruses, the SARS-CoV 
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genome contains several accessory genes that are specific to SARS-CoV and their 
encoded products have no homologue to known proteins. Phylogenetic analysis based 
on the most conserved gene ORFlb indicated that SARS-CoV is distantly related to 
the group 2 coronaviruses (now the genus Betacoronavirus) in the family 
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Figure 26.1 Genomic structure of SARS-CoV and bat SL-CoV. The highly conserved genes 
present in all coronaviruses are shown in dark-colored arrows and the betacoronavirus group 
b-specific ORFs in light-colored arrows. The most variable regions are marked with shaded 
boxes. Rp3, HKU3-1, WIV1, and WIV16 were identified from R. sinicus in China; Rml and 
Rfl from Rhinolophus macrotis and Rhinolophus ferrumequinum, respectively, in China; 
BM48-31 from Rhinolophus blasii, in Europe; Tor2 from late-phase patient during 
2002—2003 SARS outbreak; SZ from civet during 2002—2003 SARS outbreak. * The host of 
Rp3 was previously identified as Rhinolophus pearsoni and later corrected to be R. sinicus. ' 
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Coronciviridae, and represents a distinct cluster, named group 2b (now the genus Beta- 
coronavirus group b; Fig. 26.2). 12,13 


3. The Animal Link 

Due to the rapid spread of the disease and the delay in the identification of the causative 
agent, there was no detailed epidemiological tracing done at the beginning of the 
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Figure 26.2 Phylogenetic tree of betacoronavirus group b. The phylogenetic tree is 
generated based on full-length genome sequences of selected SARS-CoVs and bat SL-CoVs 
using the Neighbor-Joining algorithm in the MEGA4 program with a bootstrap of 1000 
replicates. A bat coronavirus BtCoV HKU9 is used as an outgroup. Numbers above branches 
indicate bootstrap values from 1000 replicates. Scale bar, 0.5 substitutions per site. GD01: 
SARS-CoV isolate from early-phase patient during 2002—2003 SARS outbreak; Tor2, BJ01: 
SARS-CoV isolate from late-phase patient during 2002—2003 SARS outbreak; SZ: SARS- 
CoV isolate from civet during 2002—2003 SARS outbreak; GZ0401/02: SARS-CoV isolate 
from patient during 2003—2004 SARS outbreak; and PC4-13, PC4-227: SARS-CoV isolate 
from civet during 2003-2004 SARS outbreak. BtSL-CoV: bat SARS-like CoV. Rp3, HKU3- 
1, WIV, WIV16, and LYRall were identified from R. sinicus in China; Rml from Rhinolo- 
phus macrotis in China; Rfl and YNLC31 C from Rhinolophus ferrumequinum in China; and 
BM48-31 from Rhinolophus blasii , in Europe. 






















606 


Genetics and Evolution of Infectious Diseases 


outbreaks, and it was therefore impossible to trace the origin of the virus. However, 
through retrospective investigation, it emerged that the majority of the early index 
cases were limited in several cities of the Guangdong province and most of them 
have history of contact directly or indirectly with wildlife animals, including handling, 
killing, and selling wildlife animals as well as preparing and serving wildlife animal 
meat in restaurants. 

As these epidemic regions have a unique dietary tradition favoring freshly slaugh¬ 
tered game meat, there is a huge trafficking and trading industry dedicated to live an¬ 
imal trading in specialized market, the “wet market.” Immediately after SARS-CoV 
was identified as the etiological agent of SARS, studies were conducted in those mar¬ 
kets for evidence of SARS-CoV in market animals. One of the earliest and most impor¬ 
tant studies was conducted by a joint team from Hong Kong and Shenzhen in mainland 
China. 1 In this investigation, out of 25 samples collected from market animals, SARS- 
CoV-like viruses were isolated from four out of six masked palm civets (Paguma 
larvata) and one raccoon dog (Nyctereutes procyonoides). Antibodies against 
SARS-CoV were detected in masked palm civets, raccoon dog, and Chinese ferret- 
badgers (Melogale moschata). Genome sequencing indicated that the viruses isolated 
from civets were almost identical to those from human, suggesting a highly possible 
zoonotic transmission of SARS-CoV from animal(s) to human. These data indicated 
that at least three different animal species were infected by a coronavirus that is closely 
related to SARS-CoV. This important study provided the first direct evidence that 
SARS-CoV existed in animals, pointing to an animal link of the SARS outbreaks. 

Although three animals were identified as susceptible to SARS-CoV infection, the 
larger sale volume of civets in comparison to other animals in the market made them 
the target animals of subsequent surveillance studies. The role of civets as a major car¬ 
rier of SARS-CoV in the markets was further confirmed by serological studies 
involving much large samples. 

The most detailed epidemiological data proving a direct civet to human transmis¬ 
sion of SARS-CoV was obtained during the investigation of the second wave of 
SARS outbreaks during December 2003 to January 2004. There were two lines of ev¬ 
idences suggesting a direct transmission. First, all four independent cases had the his¬ 
tory of direct or indirect contact with civets. Second, sequencing analysis indicated that 
sequences derived from human samples were more closely related to those in the civets 
during that period than those from human samples obtained in the major 2002—2003 
outbreaks. 

In summary, based on the previously mentioned study findings, it was concluded 
that the civet to human transmission is a major, if not the only, source of SARS- 
CoV introduction into the human population. 


4. Natural Reservoirs of SARS-CoV 

Natural reservoir refers to the long-term host of the pathogen of an infectious disease. 
It is often the case that hosts do not get the disease carried by the pathogen, or the 
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infection in the reservoir host is subclinical, asymptomatic, and nonlethal. Once 
discovered, natural reservoirs elucidate the complete life cycle of infectious diseases, 
which in turn will help to provide effective prevention and control strategies. 

As stated earlier, it is clear that civets played a pivotal role in the 2002—2004 out¬ 
breaks of SARS in southern China. Culling of civets seemed to be effective in control¬ 
ling further outbreaks in the region. However, the role of civets as a potential natural 
reservoir host was less evident and eventually ruled out by several studies. Serological 
and molecular studies indicated that only civets in the markets were infected with 
SARS-CoV whereas the populations of civets in the wild or on farms were free of ma- 
jor infections. ~ Civets produced overt clinical syndromes when experimentally 
infected with SARS-CoV. Comparative genome sequence analysis indicated that 
SARS-CoVs in civets experienced rapid mutation, suggesting that the viruses were 
still adapting to the host rather than persisting in equilibrium expected for viruses in 
their natural reservoir species. 

Continuing search for the potential reservoir host of SARS-CoV resulted in the 
simultaneous discovery of SARS-like coronaviruses (SL-CoVs) in bats by two inde¬ 
pendent teams in 2005. Using serological and PCR surveillance, both groups discov¬ 
ered that SL-CoVs were present in different horseshoe bats in the genus 
Rhinolophus. Complete genome sequence analysis revealed that bat SL-CoVs 
have an identical genome organization and a nucleotide sequence identity of 
88-92% to SARS-CoV (Fig. 26.1; Table 26.2). Except for the S, ORF3, and ORF8 
gene products, all deduced aa sequences of the other gene products have a sequence 
identity above 93% with those of SARS-CoV. The variable regions between SARS- 
CoV and bat SL-CoV are mainly located in the coding regions for the nonstructural 
protein 3 (Nsp 3), S protein, ORF3, and ORF8, the products of these genes have aa 
sequence identity of 87—95%, 76—78%, 82—90%, and 34—80%, respectively. Among 
the different bat SL-CoVs, the coding regions for these proteins also represent the most 
variable regions. 

The phylogenetic analysis indicated that bat SL-CoVs were grouped in the same 
cluster of SARS-CoV and were only distantly related to other previously known coro¬ 
naviruses (Fig. 26.2). To date, these bat SL-CoVs represent naturally occurring CoVs 
that are most closely related to the SARS-CoVs isolated from humans and civets. 

Analysis of nonsynonymous and synonymous substitution rates in bat SL-CoVs 
suggests that these viruses are not experiencing a positive selection pressure that would 
be expected if horseshoe bats are new host to these viruses. Instead, these data would 

• 27 79 30 

argue that these viruses have been associated with the bat hosts for a long time. 

These observations would support the notion that bats in the genus Rhinolophus are the 
likely natural reservoir hosts of bat SL-CoVs. It can be further postulated that similar 
bat species may serve as natural reservoirs of viruses with closer evolutionary relation¬ 
ship to the viruses that were responsible for the 2002—2004 SARS outbreaks. 

In this context, we and other groups continued the search for the direct progenitor of 
SARS-CoV and made great progress in the last 10 years following the initial discovery 
of SL-CoVs in horseshoe bats. First, highly diverse SL-CoVs have been found not only 
in Chinese but also in European and African bats, indicating a much wider geographic 
distribution and long evolutionary history of SL-CoVs in different bat populations 



Table 26.2 Comparison of Gene Products Between SARS-CoV and Bat SL-CoV 



Gene Product Size (aa) 

Amino Acid Sequence Identity With Tor2/sz3 (%) a 

Gene/ORF 

Tor 2 

SZ3 

Rfl 

Rp3 

Rml 

HKU3-1 

Rsl 

Rfl 

Rp3 

Rml 

HKU3-1 

Rs672 

Pla 

4382 

4382 

4377 

4380 

4388 

4376 

4189 

94 

96 

93 

94 

94 

Plb 

2628 

2628 

2628 

2628 

2628 

2628 

2628 

98 

99 

98 

98 

99 

nsp3 b 

1922 

1922 

1917 

1920 

1928 

1916 

1729 

92 

95 

90 

92 

87 

S 

1255 

1255 

1241 

1241 

1241 

1242 

1241 

76 

78 

78 

78 

79 

SI 

680 

680 

666 

666 

666 

667 

666 

63 

63 

64 

6 

64 

S2 

575 

575 

575 

575 

575 

575 

575 

92 

96 

96 

94 

96 

ORF3a 

274 

274 

274 

274 

274 

274 

274 

86 

83 

83 

82 

90 

ORF3b 

154 

154 

113 

56 

56 

39 

114 

89 

NA 

NA 

NA 

97 

ORF3c 

NP 

NP 

32 

NP 

NP 

NP 

NP 

NA 

NA 

NA 

NA 

NA 

E 

76 

76 

76 

76 

76 

76 

76 

96 

100 

98 

100 

100 

M 

221 

221 

221 

221 

221 

221 

221 

97 

97 

97 

99 

99 

ORF6 

63 

63 

63 

63 

63 

63 

63 

93 

92 

92 

94 

98 

ORF7a 

122 

122 

122 

122 

122 

122 

122 

91 

95 

93 

94 

96 
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ORF7b 

44 

44 

44 

44 

44 

44 

44 

90 

93 

93 

93 

93 

ORF8a 

39 

NP 

NP 

NP 

NP 

NP 

NP 

NA 

NA 

NA 

NA 

NA 

ORF8b 

84 

NP 

NP 

NP 

NP 

NP 

NP 

NA 

NA 

NA 

NA 

NA 

ORF8 

NP 

122 

122 

121 

121 

121 

121 

80 

35 

35 

34 

36 

N 

422 

422 

421 

421 

420 

421 

422 

95 

97 

97 

96 

99 

ORF9a 

98 

98 

96 

97 

97 

97 

98 

81 

85 

90 

88 

92 

ORF9b 

70 

70 

70 

70 

70 

70 

70 

80 

91 

91 

88 

94 


NP, not present; NA, not applicable. 

a Tor2 was used for all homology calculations with the exception of ORF8, which is absent in Tor2, the SZ3 was used instead. 
b The region of nsp3 is highly variable and was calculated alone. 
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(Table 26.2). 11 Second, great genetic diversity of SARS-CoVs were discovered in 
one particular population of R. sinicus in China by a longitudinal surveillance. 
Third and most importantly, two SL-CoV strains were isolated in Vero cells. These 
two isolates are closely related to the progenitor of the SARS-CoV not only in genomic 
sequences but also in receptor usage 4 (Figs. 26.1—26.3; Table 26.1). 


5. Molecular Evolution of SARS-CoV in Humans and 
Animals 

Analysis of the large number of SARS-CoV and SL-CoV sequence datasets accumulated 
since 2004 has clearly demonstrated the importance of vims evolution in cross-species 
transmission and in pathogenesis. The following is a summary of the major evolutionary 
findings in host switching, recombination, and vims—receptor interactions. 


5 .1 Rapid Adaptation of SARS-CoVs in Humans 

On the basis of the epidemiological data, the Chinese SARS molecular epidemiology 
consortium divided the course of the 2002—2004 outbreaks into three stages, the early, 
middle, and late phases, respectively. The early phase is defined as the period from the 
first emergence of SARS to the first documented SSE. The middle phase refers to the 
ensuing events up to the first cluster of SARS cases in a hotel (Hotel M) in Hong Kong, 
while cases following this cluster fall into the late phase. 

Analysis of all the viral sequences available from human patients and animals 
revealed two major hallmarks of rapid vims evolution during the initial stages of 
the 2002—2003 outbreaks: (1) All isolates from early patients and market animals con¬ 
tained a 29-nucleotide (nt) sequence in ORF8 that is absent in most of the publicly 
available human SARS-CoV sequences derived from later phases of the outbreaks; 
(2) characteristic motif of single-nucleotide variations (SNVs) were identified in 
SARS-CoVs of different phases and all these SNVs were located in the S gene that 
codes for the spike protein responsible for attachment to the host cellular receptor. 
All SARS-CoV isolates from epidemic countries and regions outside mainland China 
could be traced to Guangdong or Hong Kong based on the S-gene SNV motif. 

During the second sporadic outbreaks of 2003—2004, it was shown that the SARS- 
CoV sequences from index patients were almost identical to that from civets collected 
in the same period and all retained the 29-nt sequence in the ORF8 gene. The mild dis¬ 
ease symptoms associated with these vimses and the lack of rapid human-to-human 
transmission provided further evidence that the rapid adaptation of the SARS-CoV 
in the first major outbreak of 2002—2003 was essential for its establishment and path¬ 
ogenesis in humans. 

With the available genomic variation data and the sampling time, it is now possible 
to calculate the neutral mutation rate and to estimate the date for the most recent com¬ 
mon ancestors (MRCAs) of SARS-CoV. The estimate obtained is around 
8.00 x 10 6 nt 1 day ', suggesting that SARS-CoV evolves at a relatively constant 
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Figure 26.3 Alignment of amino acid sequences covering the receptor-binding motif from viruses of different species origin. GDO 1 : S ARS-CoV 
isolate from early-phase patient during 2002—2003 SARS outbreak; Tor2, BJ01: SARS-CoV isolate from late-phase patient during 2002—2003 
SARS outbreak; SZ: SARS-CoV isolate from civet during 2002—2003 SARS outbreak; GZ0402: SARS-CoV isolate from patient during 2003—2004 
SARS outbreak; and PC4-227: SARS-CoV isolate from civet during 2003—2004 SARS outbreak. * indicates the two key residues 479 and 487. Rp3, 
HKU3-1. WIV, WIV16, and LYRall were identified from R. sinicus in China; Rml from Rhinolophus macrotis in China; Rfl and YNLC31C from 
Rhinolophus ferrumequinum in China; and BM48-31 from Rhinolophus blasii, in Europe. 
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neutral rate both in humans and palm civet. From these calculations, it was estimated 
that the MRCAs for palm civet and humans of different transmission lineages lie in 
mid-November 2002. This estimate was consistent with the first observed SARS 
case around November 16, 2002 in Foshan, Guangdong. 

5.2 Generation of Viral Genetic Diversity by Recombination 

At the present time, at least 33 full-length genome sequences of bat SL-CoVs were 
determined. Shown in Fig. 26.1 is a comparison of the genome struc¬ 

tures for seven selected bat SL-CoVs and one each of civet and human SARS-CoV 
isolates. All bat SL-CoVs, with the exception of HKU3-8 and BM48-31, 32 contain 
the 29-nt sequence in ORF8, which is present in SARS-CoV from early-phase patients 
and civets, indicating the common ancestor between civet SARS-CoV and bat SL- 
CoV. The SL-CoV HKU3-8 contained a 26-nt deletion that is located 14 nt down¬ 
stream from the commonly observed 29-nt deletion, and the BM48-31 completely 
lost the ORF8, indicating that the ORF8 coding region is a “hotspot” for deletions. 

SL-CoVs from different bat species share 88—97% nt identity among themselves, 
indicating that the genetic diversity of SL-CoVs in bats is much greater than that 
observed among civet or human isolates. The most dramatic sequence difference be¬ 
tween human SARS-CoV and bat SL-CoV is in the S protein that has 76—97% aa 
identity for the whole S protein and 64—95% aa identity for the N-terminal region 
(or the SI region; Table 26.2). This great genetic diversity observed among bat SL- 
CoVs and the major difference between the SI regions of SL-CoV and SARS-CoV 
S proteins clearly demonstrated that bats are natural reservoirs of human SARS-CoV. 

It is well documented that the positive-sense ssRNA genomes of coronaviruses are 
prone to homologous recombination during coinfection of different coronaviruses and 
that recombination plays an important role in generating new coronavirus species, in 
facilitating cross-species transmission and in modulating virus virulence. 

Several studies provided evidence for coinfection and recombination came from anal¬ 
ysis of SL-CoVs in bats. It was further revealed that recombination can occur at 

multiple sites along the SL-CoV genome. For example, detailed sequence 

analysis of two genotypes of bat SL-CoV, Rp3 and Rs672 (both were identified from 
R. sinicus), suggested that they may represent a recombinant of two bat SL-CoVs and 
one of them is more closely related to the human SARS-CoVs. During 2015 and 
2016, two teams reported a full-length ORF8 that shares higher sequence similarities 
to the SARS-CoV GZ02 and civet SARS-CoV SZ3 than previously detected SL- 
CoVs. These results suggest that SARS-CoV most likely originated from different 
bat SL-CoVs via a complicated evolutionary path that involved recombination events. 

5.3 Receptor Usage and Evolutionary Selection 

The S protein of coronavirus is responsible for attachment to cellular receptor to 
initiate the first step of virus infection. The angiotensin-converting enzyme 2 
(ACE2) was identified as a main functional receptor for SARS-CoV. Further analysis 
demonstrated that the region covering aa 318—520 of S protein is the key receptor- 
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binding domain (RBD), which is both essential and sufficient to bind the human ACE2 
molecule in vitro. Detailed analysis of the crystal structure of the RBD—ACE2 com¬ 
plex revealed that 19 key residues have close contact with the receptor molecule, 
which are located from aa 424 to 474. This region is termed the receptor-binding motif 
(RBM). 47 

When the existing epidemiological data was analyzed in combination with the data 
on infectivity of SARS-CoV isolated in humans at the different phases of the outbreaks 
and SARS-CoV isolates in civets, a clear correlation could be established between the 
evolution of the S proteins and virus infectivity. It was observed that the S protein is 
the fastest evolving protein of SARS-CoV during interspecies transmission from ani¬ 
mal to human and in the following phases of human to human transmission. The ma¬ 
jority of the mutations are located in the SI domain (31 of a total of 48 SNVs), 
particularly in the RBD. The interaction analysis between the S proteins of different 
isolates and the ACE2 molecules demonstrated that two aa residues in the S protein, aa 
479 and aa 487, played a key role in virus infectivity. For aa residue 479, all 
2002—2003 human isolates contain asparagine (N). The palm civet isolates seem to 
have variable aa residues at this position, all 2002—2003 and some 2003—2004 civet 
isolates have lysine (K) while other 2003—2004 isolates have either asparagine (N) or 
arginine (R). For aa residue 487, all isolates including those from early- and middle- 
phase patients, civets of 2002—2003 and 2003—2004, have a codon for serine (S), 
whereas all isolates from 2002—2003 late-phase human patients have a codon for thre¬ 
onine (T) (Fig. 26.3). When examined using an HIV-based pseudovirus infection 
assay, S proteins with all combinations of residues 487/479 could efficiently use the 
civet ACE2 as an entry receptor, but showed different infectivity in human ACE2- 
mediated infection. The combination of N479/T487 had the highest infectivity, 
N479/S487 medium infectivity, and K479/S487 the lowest, which almost abolished 
the infection. These results demonstrated elegantly at the molecular interface that 
the rapid evolution of the S protein, especially at the aa positions important for host 
receptor engagement, was essential for the adaptation to and establishment of an effec¬ 
tive and productive human infection. 

When the genome sequences of SL-CoVs were analyzed, it became evident that the 
N-terminal regions of their S proteins are the most divergent among themselves, as 
well as with the SARS-CoV. As shown in Fig. 26.3, bat SL-CoVs can be grouped 
into three groups based on the RBM sequences. The strains discovered early are close 
to each other and have a major sequence difference involving deletions of 17—18 aa 
right in the middle of RBM. We have since demonstrated experimentally that SL- 
CoV S proteins are unable to use ACE2 molecule, regardless of its origin, as a func¬ 
tional receptor. The second group, identified from European bats, has deletions of 4 
aa. The third group, discovered recently, has no deletion and contains an identical 
size as the SARS-CoV in the S protein (Fig. 26.3).’ 1 As predicted from their S 
sequences, three isolates from the third group, SL-CoV—WIV1, WIV16, and 
SHC014, have been shown to be able to use ACE2 for cellular entry, even though these 
S proteins still have slight difference at the key aa involved in direct interaction with 
ACE2. Most importantly, the SHC014 can replicate well in transgenic mice 

containing human ACE2, and it caused tissue damage in tested animals. 



614 


Genetics and Evolution of Infectious Diseases 


6. Coronavirus Surveillance in Wildlife Animals 

Zoonosis contributes to the majority of emerging disease in the last 30 years, many of 
them originated from wildlife animals. The story of SARS is just one of such ex¬ 
amples that spectacularly demonstrated the seamless evolution of a bat virus into a hu¬ 
man pathogen responsible for one of the most severe global pandemic outbreaks in 
modern history of mankind. In general, pathogens carried by wildlife reservoir animals 
usually do not cause clinical symptoms and they lie dormant until they spill over into 
and cause diseases in domestic animals or humans. Classical outbreak response mea¬ 
sures, such as those deployed during the SARS outbreaks, are still useful, but no longer 
sufficient for early detection and prevention of major infectious disease outbreaks in 
the 21st century. 

With the demonstration of an increasing number of spillover events that led to severe 
disease outbreaks in human and domestic animals, we believe it is paramount that from 
now on we include active surveillance of wildlife animals as part of an integrated infec¬ 
tious disease prevention and control strategy. Surveillance of wildlife animals has also 
been made more feasible and productive, thanks to the advance in modem molecular 
techniques including PCR with virus group-specific primers, vims discovery using 
next generation high-throughput sequencing technologies, and high density vims 
microarrays. 56 Since the SARS outbreaks, especially after the discovery of SL- 
CoVs in bats, there is a significant surge in international effort for surveillance of 
coronavimses in wildlife animals. Before the SARS outbreak, there were only 10 coro- 
navimses with complete genomes sequenced. This number has increased more than 
sixfold as a result of the active surveillance works conducted around the 
world. 27-29,31 ’ 32 ’ 34 ' 40 ’ 55 ' 64-73 Although this only marks the beginning of our under¬ 
standing of coronavimses in wildlife animals, it is fair to say that we have learnt a lot 
more about coronavimses in general than the past 50 years or so; during that period 
studying of vimses was only possible and called for in response to disease outbreaks. 
Based on phylogenetic analysis of the large number of bat coronavirus sequences avail¬ 
able presently, it is postulated that all known disease-causing coronavimses previously 
identified in humans or animals originated from bat strains. This hypothesis 

was unfortunately proved by the outbreak of another SARS-like disease, Middle East 
Respiratory Syndrome (MERS), which was caused by a novel coronavims (previously 
named HCoV-EMC, now MERS-CoV) and supposed to originate from bats. Even 
though the MERS-CoV-like vimses found in bats are not the direct progenitor of the 
MERS-CoV, the highly genetic diversity of these bat vimses is likely the gene sources 
for the deadly pathogen in humans, just like that for SARS-CoV. 


7. Concluding Remarks 

The emergence of SARS-CoV has had a huge impact on the global health and economy. 
It served as a warning to what may come out of a seemingly harmless virus—reservoir 
equilibrium in bats or any other wildlife species. At the same time, the experience 
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gained from the SARS outbreaks and the following in-depth studies on SARS-like 
coronaviruses has provided and will continue to provide invaluable knowledge and 
guideline to our future fight against new and emerging infectious diseases. One of 
the major lessons is that we need to pay much more attention to the reservoir species 
in understanding the genetic diversity of different viruses, the intricate interplay at 
the virus—host interface, and the major factors responsible for the disturbance of virus- 
host equilibrium, which in turn trigger spillover events leading to disease outbreaks. 
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